Picture for Minh N. Do

Minh N. Do

Are you SURE? Enhancing Multimodal Pretraining with Missing Modalities through Uncertainty Estimation

Add code
Apr 18, 2025
Viaarxiv icon

Leveraging Perfect Multimodal Alignment and Gaussian Assumptions for Cross-modal Transfer

Add code
Mar 19, 2025
Viaarxiv icon

GIST: Towards Photorealistic Style Transfer via Multiscale Geometric Representations

Add code
Dec 03, 2024
Viaarxiv icon

R.I.P.: A Simple Black-box Attack on Continual Test-time Adaptation

Add code
Dec 02, 2024
Viaarxiv icon

Improving the Robustness of 3D Human Pose Estimation: A Benchmark and Learning from Noisy Input

Add code
Dec 11, 2023
Viaarxiv icon

Persistent Test-time Adaptation in Episodic Testing Scenarios

Add code
Nov 30, 2023
Viaarxiv icon

Making Vision Transformers Truly Shift-Equivariant

Add code
May 25, 2023
Figure 1 for Making Vision Transformers Truly Shift-Equivariant
Figure 2 for Making Vision Transformers Truly Shift-Equivariant
Figure 3 for Making Vision Transformers Truly Shift-Equivariant
Figure 4 for Making Vision Transformers Truly Shift-Equivariant
Viaarxiv icon

MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation

Add code
Mar 09, 2023
Figure 1 for MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Figure 2 for MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Figure 3 for MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Figure 4 for MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Viaarxiv icon

FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Co-Training

Add code
Nov 20, 2022
Figure 1 for FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Co-Training
Figure 2 for FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Co-Training
Figure 3 for FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Co-Training
Figure 4 for FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Co-Training
Viaarxiv icon

Enhancing Few-shot Image Classification with Cosine Transformer

Add code
Nov 16, 2022
Figure 1 for Enhancing Few-shot Image Classification with Cosine Transformer
Figure 2 for Enhancing Few-shot Image Classification with Cosine Transformer
Figure 3 for Enhancing Few-shot Image Classification with Cosine Transformer
Figure 4 for Enhancing Few-shot Image Classification with Cosine Transformer
Viaarxiv icon